AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Swin-BART architecture

# Swin-BART architecture

OCR DocVQA Donut
MIT
Donut is an OCR-free document understanding Transformer model that combines a visual encoder and text decoder for document visual question answering tasks.
Image-to-Text Transformers
O
jinhybr
240
13
OCR Donut CORD
MIT
Donut is an OCR-free document understanding model based on Swin Transformer visual encoder and BART text decoder, this version is fine-tuned on CORD receipt dataset
Image-to-Text Transformers
O
jinhybr
1,130
206
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase